Search results for "Search engine"
showing 10 items of 121 documents
Large Scale Knowledge Matching with Balanced Efficiency-Effectiveness Using LSH Forest
2017
Evolving Knowledge Ecosystems were proposed to approach the Big Data challenge, following the hypothesis that knowledge evolves in a way similar to biological systems. Therefore, the inner working of the knowledge ecosystem can be spotted from natural evolution. An evolving knowledge ecosystem consists of Knowledge Organisms, which form a representation of the knowledge, and the environment in which they reside. The environment consists of contexts, which are composed of so-called knowledge tokens. These tokens are ontological fragments extracted from information tokens, in turn, which originate from the streams of information flowing into the ecosystem. In this article we investigate the u…
Production and Characterization of Medium-Sized and Short Antioxidant Peptides from Soy Flour-Simulated Gastrointestinal Hydrolysate.
2021
Soybeans (Glycine max) are an excellent source of dietary proteins and peptides with potential biological activities, such as antihypertensive, anti-cholesterol, and antioxidant activity
The 100 most-cited articles in orthodontics: A bibliometric study
2018
ABSTRACT Objectives: To identify and analyze the 100 most-cited articles in orthodontics indexed in the Web of Science Category of “Dental, Oral Surgery and Medicine” from 1946 to 2016. Materials and Methods: On hundred articles were identified in a search of the database of the ISI Web of Science and Journal Citation Reports, applying the truncated search term “orthodon*.” Records were manually refined and normalized to unify terms and to remove typographical, transcription, and/or indexing errors. Results: The 100 most-cited articles were published between 1946 and 2012, with numbers of citations ranging from 115 to 881. Of the 251 authors participating, 87.65% published a single work, wh…
Effectively and efficiently supporting crowd-enabled databases via NoSQL paradigms
2013
In this paper we provide an overview of the Hints From the Crowd (HFC) project, whose main goal is to build a NoSQL database system for large collections of product reviews; the database is queried by expressing a natural language sentence; the result is a list of products ranked based on the relevance of reviews w.r.t. the natural language sentence. The best ranked products in the result list can be seen as the best hints for the user based on crowd opinions (the reviews). The HFC prototype has been developed as a web application, independent of the particular application domain of the collected product reviews. Queries are performed by evaluating a text-based ranking metric for sets of re…
Indexing Multimedia Learning Materials in Ultimate Course Search
2016
International audience; Multimedia is the main support for online learning materials and the size of multimedia learning materials is growing with the popularity of online programs offered by Universities. Ultimate Course Search (UCS) is a tool that aims to provide efficient search of course materials. UCS integrates slides, lecture videos and textbook content into a single platform with search capabilities. The keywords extracted from the textbook index and the power-point slides are the basis of the indexing scheme. The slides are indexed on the keywords and the videos are indexed on the slides. The correspondence between the slides and video segments is established using the meta-data pr…
Languages with mismatches
2007
AbstractIn this paper we study some combinatorial properties of a class of languages that represent sets of words occurring in a text S up to some errors. More precisely, we consider sets of words that occur in a text S with k mismatches in any window of size r. The study of this class of languages mainly focuses both on a parameter, called repetition index, and on the set of the minimal forbidden words of the language of factors of S with errors. The repetition index of a string S is defined as the smallest integer such that all strings of this length occur at most in a unique position of the text S up to errors. We prove that there is a strong relation between the repetition index of S an…
Combining textual and visual cues for content-based image retrieval on the World Wide Web
2002
A system is proposed that combines textual and visual statistics in a single index vector for content-based search of a WWW image database. Textual statistics are captured in vector form using latent semantic indexing (LSI) based on text in the containing HTML document. Visual statistics are captured in vector form using color and orientation histograms. By using an integrated approach, it becomes possible to take advantage of possible statistical couplings between the content of the document (latent semantic content) and the contents of images (visual statistics). The combined approach allows improved performance in conducting content-based search. Search performance experiments are report…
RepeatsDB 2.0: improved annotation, classification, search and visualization of repeat protein structures
2017
RepeatsDB 2.0 (URL: http://repeatsdb.bio.unipd.it/) is an update of the database of annotated tandem repeat protein structures. Repeat proteins are a widespread class of non-globular proteins carrying heterogeneous functions involved in several diseases. Here we provide a new version of RepeatsDB with an improved classification schema including high quality annotations for ∼5400 protein structures. RepeatsDB 2.0 features information on start and end positions for the repeat regions and units for all entries. The extensive growth of repeat unit characterization was possible by applying the novel ReUPred annotation method over the entire Protein Data Bank, with data quality is guaranteed by a…
Disability assessment using Google Maps.
2021
Objectives To evaluate the concordance between Google Maps® application (GM®) and clinical practice measurements of ambulatory function (e.g., Ambulation Score (AS) and respective Expanded Disability Status Scale (EDSS)) in people with multiple sclerosis (pwMS). Materials and methods This is a cross-sectional multicenter study. AS and EDSS were calculated using GM® and routine clinical methods; the correspondence between the two methods was assessed. A multinomial logistic model is investigated which demographic (age, sex) and clinical features (e.g., disease subtype, fatigue, depression) might have influenced discrepancies between the two methods. Results Two hundred forty-three pwMS were …
Reverse-safe data structures for text indexing
2021
We introduce the notion of reverse-safe data structures. These are data structures that prevent the reconstruction of the data they encode (i.e., they cannot be easily reversed). A data structure D is called z-reverse-safe when there exist at least z datasets with the same set of answers as the ones stored by D. The main challenge is to ensure that D stores as many answers to useful queries as possible, is constructed efficiently, and has size close to the size of the original dataset it encodes. Given a text of length n and an integer z, we propose an algorithm which constructs a z-reverse-safe data structure that has size O(n) and answers pattern matching queries of length at most d optim…